Channel identification and spectrum estimation for robust automatic speech recognition

نویسنده

  • Yunxin Zhao
چکیده

A feature estimation technique is proposed for speech signals that are corrupted by both additive and convolutive noises via combining channel identification with power spectrum estimation. A correlation-matching algorithm is developed for channel identification, and a Gaussian mixture density model of speech DFT spectra is formulated for estimation of speech power spectra. Cepstral features of speech are calculated from the estimated power spectra. Using the proposed method, significantly improved accuracy was achieved on speaker-independent continuous speech recognition where the speech data were corrupted by a simulated linear distortion channel and additive white noise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

Two-pass Quantile Based Noise Spectrum Estimation

Noise spectrum estimation from a noisy speech signal forms a critical part of such applications as single channel speech enhancement and robust automatic speech recognition (ASR). The two-pass quantile based noise estimation algorithm presented in this paper has the ability to track slow changing non-stationary noise and obtains good estimates for various noise types over a wide range of SNR le...

متن کامل

Robust Iris Recognition in Unconstrained Environments

A biometric system provides automatic identification of an individual based on a unique feature or characteristic possessed by him/her. Iris recognition (IR) is known to be the most reliable and accurate biometric identification system. The iris recognition system (IRS) consists of an automatic segmentation mechanism which is based on the Hough transform (HT). This paper presents a robust IRS i...

متن کامل

On the robust incorporation of formant features into hidden Markov models for automatic speech recognition

A formant analyser is interpreted probabilistically via a noisy channel model. This leads to a robust method of incorporating formant features into hiddenMarkov models for automatic speech recognition. Recognition equations follow trivially, and Baum-Welch style re-estimation equations are derived. Experimental results are presented which provide empirical proof of convergence, and demonstrate ...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999